Performance Bounds for Column-block Partitioning of Parallel Gaussian Elimination and Gauss-jordan Methods
نویسنده
چکیده
Column-block partitioning is commonly used in the parallelization of Gaussian-Elimination(GE) and Gauss-Jordan(GJ) algorithms. It is therefore of interest to know performance bounds of such partitioning on scalable distributed-memory parallel architectures. In this paper, we use a graph-theoretic approach in deriving asymptotic performance lower bounds of column-block partitioning for both GE and GJ. The The content of the information herein does not necessarily reeect the position of the Government and oocial endorsement should not be inferred. 1 new contribution is the incorporation of communication cost in the analysis which results in the derivation of sharper lower bounds. We use our scheduling system PYRROS to experimentally compare the actual run time performance with that derived by these lower bounds on the nCUBE-2 hypercube parallel machine.
منابع مشابه
Gauss-Jordan Elimination for Matrices Represented as Functions
This theory provides a compact formulation of Gauss-Jordan elimination for matrices represented as functions. Its distinctive feature is succinctness. It is not meant for large computations. 1 Gauss-Jordan elimination algorithm theory Gauss-Jordan-Elim-Fun imports Main begin Matrices are functions: type-synonym ′a matrix = nat ⇒ nat ⇒ ′a In order to restrict to finite matrices, a matrix is usua...
متن کاملStability of Gauss-Huard Elimination for Solving Linear Systems
This paper considers elimination methods to solve dense linear systems, in particular a variant due to Huard of Gaussian elimination [13]. This variant reduces the system to an equivalent diagonal system just as GaussJordan elimination, but does not require more floating-point operations than Gaussian elimination. Huard's method may be advantageous for use in computers with hierarchical memory,...
متن کاملSingly-Bordered Block-Diagonal Form for Minimal Problem Solvers
The Gröbner basis method for solving systems of polynomial equations became very popular in the computer vision community as it helps to find fast and numerically stable solutions to difficult problems. In this paper, we present a method that potentially significantly speeds up Gröbner basis solvers. We show that the elimination template matrices used in these solvers are usually quite sparse a...
متن کاملMeasuring the Overhead of Intel C++ CnC over TBB for Gauss-Jordan Elimination
The most efficient parallel execution of dense liner algebra algorithms is to build and evaluate the task graph constrained only by the data dependencies between the tasks. Both Intel C++ Concurrent Collections (CnC) and Threading Building Blocks (TBB) libraries allow such task-based parallel programming. In this paper, we first analyze all the three types of data dependencies in the tiled in-p...
متن کاملMinimizing communication overhead for matrix inversion algorithms on hypercubes
The mirin contribution of this report is the development of novel algorithms {that make efficient use of the communication system in distributed memory architectures with plrocessing elements interconnected by a hypercube network. These algorithms achieve almost optirr~al overlap of communicatior~ delays by computation, leading to a minimization of communicatioi~ overhead. Rigorous ana1yt:ical ...
متن کامل